Reinforcement Learning of Parameters for Humanoid Rhythmic Walking based on Visual Information

نویسندگان

  • Masaki Ogino
  • Yutaka Katoh
  • Minoru Asada
  • Koh Hosoda
چکیده

This paper presents a method for learning the parameters of rhythmic walking to generate a purposive motion. The controller consists of the two layers. Rhythmic walking is realized by the lower layer controller which adjusts the speed of the phase on the desired trajectory depending on the sensor information. The upper layer controller learns (1) the feasible parameter sets that enable a stable walking for a robot, (2) the causal relationship between the walking parameters to be given to the lower layer controller and the change of the sensor information, and (3) the feasible rhythmic walking parameters by reinforcement learning so that a robot can reach to the goal based on the visual information. The method was examined in the real robot, and it learns to reach the ball and to shoot it into the goal in the context of RoboCupSoccer competition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reinforcement learning of humanoid rhythmic walking parameters based on visual information

This paper presents a method for generating vision-based humanoid behaviors by reinforcement learning with rhythmic walking parameters. The walking is stabilized by a rhythmic motion controller such as CPG or neural oscillator. The learning process consists of two stages: first one is building an action space with two parameters (a forward step length and a turning angle) so that infeasible com...

متن کامل

A Humanoid Approaches to the Goal - Reinforcement Learning Based on Rhythmic Walking Parameters

This paper presents a method for generating vision-based humanoid behaviors by reinforcement learning with rhythmic walking parameters. The walking is stabilized by a rhythmic motion controller such as CPG or neural oscillator. The learning process consists of two stages: first one is building an action space with two parameters (a forward step length and a turning angle) so that infeasible com...

متن کامل

Vision-based reinforcement learning for humanoid behavior generation with rhythmic walking parameters

This paper presents a method for generating vision-based humanoid behaviors by reinforcement learning with rhythmic walking parameters. The walking is stabilized by a rhythmic motion controller such as CPG or neural oscillator. The learning process consists of two stages: the first one is building an action space with two parameters (a forward step length and a turning angle) that inhibits comb...

متن کامل

Episodic Reinforcement Learning Control Approach for Biped Walking

This paper presents a hybrid dynamic control approach to the realisation of humanoid biped robotic walk, focusing on the policy gradient episodic reinforcement learning with fuzzy evaluative feedback. The proposed structure of controller involves two feedback loops: a conventional computed torque controller and an episodic reinforcement learning controller. The reinforcement learning part inclu...

متن کامل

Dynamic Control Algorithm for Biped Walking Based on Policy Gradient Fuzzy Reinforcement Learning

This paper presents a novel dynamic control approach to acquire biped walking of humanoid robots focussed on policy gradient reinforcement learning with fuzzy evaluative feedback . The proposed structure of controller involves two feedback loops: conventional computed torque controller including impact-force controller and reinforcement learning computed torque controller. Reinforcement learnin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003